All Questions
26 questions
0votes
0answers
19views
CLIPSeg: no change in performance metrics with a better convolutional decoder
I am training CLIPSeg on the Oxford IIIT pet dataset for semantic segmentation (3 classes: background, cat, dog). In short, what I do is I stick a decoder on the CLIP encoder. The encoder outputs: ...
0votes
0answers
31views
Why is training for the game 2048 not working well for me?
So i started training neural networks for the game 2048. Here is my code for using DQN algorithms to train. The gymnasium env code should be OK, so im not putting it here, but if you need, here is the ...
0votes
0answers
60views
How to solve the issue with getting free ports in Pytorch DDP?
I am facing issues with getting a free port in the DDP setup block of PyTorch for parallelizing my deep learning training job across multiple GPUs on a Linux HPC cluster. I am trying to submit a deep ...
0votes
0answers
68views
Convert sciBERT to GGUF
I want to use the SciBERT weights in Ollama. Ollama accepts GGUF format, whereas SciBERT is in another format. I downloaded SciBERT from this huggingface link. I tried to convert it with llama.cpp ...
2votes
0answers
109views
Solving the 2D Helmholtz Equation over a sphere with a PINN
Background: I am working on implementing solutions to various partial differential equations through Physics-informed neural networks (PINNs). I have started working on the Helmholtz equation and was ...
2votes
1answer
124views
Custom Loss Function Traps Network in Local Optima
I am working with a feedforward neural network to fit the following simple function: N(1) = -1 N(2) = -1 N(3) = 1 N(4) = -1 But I don't want to use the Mean-...
0votes
0answers
38views
PPO with multiple actions per action vector
I would like to have the following vector for example [0.2,0.6,0.3,0.4,0.8] end up looking like this after training [0,1,0,0,1]. In other words , rather than choosing one action, I'm choosing more ...
0votes
0answers
59views
PyTorch and Opacus for Differential Privacy
When testing an example code from the TensorFlow website using Jupyter Notebook, which is available at the following link: [LINK_1], I encountered an error. You can find my question about that error ...
0votes
0answers
22views
Spikes in Loss During Training both train/val datasets with LSTM
I'm seeing good results I think, but I want to understand why these spikes in loss are occuring. As you can see, it would appear that my training is working as it should, but every 200 or so epochs ...
0votes
0answers
61views
SSIM in validation higher then SSIM in training for image denoising
I'm working to denoise microscopy images using a 2D U-Net. I'm training my network on images taken at different z-levels, and these images have ground truth, which is the mean of the images in z. ...
0votes
1answer
148views
How to Create a 1D Embedding from Tensors of Varying Sizes?
I am a newbie in AI and playing with some computer vision algorithms. I have three tensors with different sizes. Noise augmentation levels tensor with size (N, C, H, W), diffusion timestep tensor of ...
1vote
2answers
106views
How to Represent Boardless Board Game as Input to RL Model?
I am currently doing my thesis project by creating an Imitation Learning (IL) agent that learns to play the board game Hive, which lacks a traditional 2D board. Pieces are placed relative to one ...
2votes
1answer
1kviews
How do I start implementing a deep learning paper from scratch?
I am fairly new to deep learning. I want to implement a deep learning paper from scratch with proper data preprocessing, model, losses etc., using an object-oriented approach in python. I want to do ...
1vote
0answers
482views
Where can I get Imagenet test dataset labels for evaluation
I have the imagenet train, validation and test set. I have been able to assign each image in the validation set into its respective class folders with the help of some online resources. However, for ...
0votes
1answer
164views
How to improve the performance when no shuffling of dataloader is needed?
I'm currently doing some researches on video recognition. What I'm trying to do is like this paper. The idea is that: for processing a specific input video clip (shape: [T, C, H, W]), it needs ...